Robust-linear-model normalization to reduce technical variability in functional protein microarrays.

نویسندگان

  • Andrea Sboner
  • Alexander Karpikov
  • Gengxin Chen
  • Michael Smith
  • Dawn Mattoon
  • Lisa Freeman-Cook
  • Barry Schweitzer
  • Mark B Gerstein
چکیده

Protein microarrays are similar to DNA microarrays; both enabling the parallel interrogation of thousands of probes immobilized on a surface. Consequently, they have benefited from technologies previously developed for DNA microarrays. However, assumptions for the analysis of DNA microarrays do not always translate to protein arrays, especially in the case of normalization. Hence, we have developed an experimental and computational framework to assess normalization procedures for protein microarrays. Specifically, we profiled two sera with markedly different autoantibody compositions. To analyze intra- and interarray variability, we compared a set of control proteins across subarrays and the corresponding spots across multiple arrays, respectively. To estimate the degree to which the normalization could help reveal true biological separability, we tested the difference in the signal between the sera relative to the variability within replicates. Next, by mixing the sera in different proportions (titrations), we correlated the reactivity of proteins with serum concentration. Finally, we analyzed the effect of normalization procedures on the list of reactive proteins. We compared global and quantile normalization, techniques that have traditionally been employed for DNA microarrays, with a novel normalization approach based on a robust linear model (RLM) making explicit use of control proteins. We show that RLM normalization is able to reduce both intra- and interarray technical variability while maintaining biological differences. Moreover, in titration experiments, RLM normalization enhances the correlation of protein signals with serum concentration. Conversely, while quantile and global normalization can reduce interarray technical variability, neither is as effective as RLM normalization in maintaining biological differences. Most importantly, both introduce artifacts that distort the signals and affect the correct identification of reactive proteins, impairing their use for biomarker discovery. Hence, we show RLM normalization is better suited to protein arrays than approaches used for DNA microarrays.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Normalization of qPCR array data: a novel method based on procrustes superimposition

MicroRNAs (miRNAs) are short, endogenous non-coding RNAs that function as guide molecules to regulate transcription of their target messenger RNAs. Several methods including low-density qPCR arrays are being increasingly used to profile the expression of these molecules in a variety of different biological conditions. Reliable analysis of expression profiles demands removal of technical variati...

متن کامل

How to Choose a Normalization Strategy for miRNA Quantitative Real-Time (qPCR) Arrays

Low-density arrays for quantitative real-time PCR (qPCR) are increasingly being used as an experimental technique for miRNA expression profiling. As with gene expression profiling using microarrays, data from such experiments needs effective analysis methods to produce reliable and high-quality results. In the pre-processing of the data, one crucial analysis step is normalization, which aims to...

متن کامل

Removing technical variability in RNA-seq data using conditional quantile normalization

The ability to measure gene expression on a genome-wide scale is one of the most promising accomplishments in molecular biology. Microarrays, the technology that first permitted this, were riddled with problems due to unwanted sources of variability. Many of these problems are now mitigated, after a decade's worth of statistical methodology development. The recently developed RNA sequencing (RN...

متن کامل

ارزیابی پویشگر ریسک به منظور شناسایی ریسک‌های در حال ظهور با استفاده از مدل آنالیز تشدید کارکرد: مطالعه‌ی موردی در یک واحد فرایندی

  Background and aim: Today, it was revealed that Socio-technical systems did not have a bimodal nature and interactions in these systems are complex and non-linear. Consequently, since risks can be emerged as non-linear combinations of performance variability, so traditional methods of risk assessment are not able to capture these combinations. The present paper is aimed at identifying the eme...

متن کامل

Model selection and efficiency testing for normalization of cDNA microarray data by iterative local regression

We present in this study two novel normalization schemes for cDNA microarrays. They are based on iterative local regression and optimization of model parameters by generalized cross-validation. Permutation tests assessing the efficiency of normalization demonstrated that the proposed schemes have an improved ability to remove systematic errors and to reduce variability in microarray data. The a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Journal of proteome research

دوره 8 12  شماره 

صفحات  -

تاریخ انتشار 2009